Search CORE

23 research outputs found

Semantic concept schema of the linear mixed model of experimental observations

Author: Eeuwijk Fred, van
Filipiak Katarzyna
Gonzalez-Beltran Alejandra N.
Krajewski Paweł
Markiewicz Augustyn
Millet Emilie J.
Rocca-Serra Philippe
Sansone Susanna Assunta
Ćwiek-Kupczyńska Hanna
Ławrynowicz Agnieszka
Publication venue
Publication date: 01/01/2020
Field of study

In the information age, smart data modelling and data management can be carried out to address the wealth of data produced in scientific experiments. In this paper, we propose a semantic model for the statistical analysis of datasets by linear mixed models. We tie together disparate statistical concepts in an interdisciplinary context through the application of ontologies, in particular the Statistics Ontology (STATO), to produce FAIR data summaries. We hope to improve the general understanding of statistical modelling and thus contribute to a better description of the statistical conclusions from data analysis, allowing their efficient exploration and automated processing.</p

Wageningen University & Research Publications

Oxford University Research Archive

Software Citation Implementation Challenges

The main output of the FORCE11 Software Citation working group (https://www.force11.org/group/software-citation-working-group) was a paper on software citation principles (https://doi.org/10.7717/peerj-cs.86) published in September 2016. This paper laid out a set of six high-level principles for software citation (importance, credit and attribution, unique identification, persistence, accessibility, and specificity) and discussed how they could be used to implement software citation in the scholarly community. In a series of talks and other activities, we have promoted software citation using these increasingly accepted principles. At the time the initial paper was published, we also provided guidance and examples on how to make software citable, though we now realize there are unresolved problems with that guidance. The purpose of this document is to provide an explanation of current issues impacting scholarly attribution of research software, organize updated implementation guidance, and identify where best practices and solutions are still needed

arXiv.org e-Print Archive

Institute of Transport Research:Publications

Community standards for open cell migration data

Author: Ampe Christophe
Bakker Gert-Jan
Besson Sébastien
Eibl Robert H.
Friedl Peter
Gonzalez-Beltran Alejandra N.
Gunzer Matthias
Kittisopikul Mark
Le Dévédec Sylvia E.
Leo Simone
Martens Lennart
Masuzzo Paola
Moore Josh
Paran Yael
Prilusky Jaime
Rocca-Serra Philippe
Roudot Philippe
Sansone Susanna-Assunta
Schuster Marc
Sergeant Gwendolien
Strömblad Staffan
Swedlow Jason R.
van Erp Merijn
Van Troys Marleen
Zaritsky Assaf
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2020
Field of study

Cell migration research has become a high-content field. However, the quantitative information encapsulated in these complex and high-dimensional datasets is not fully exploited owing to the diversity of experimental protocols and non-standardized output formats. In addition, typically the datasets are not open for reuse. Making the data open and Findable, Accessible, Interoperable, and Reusable (FAIR) will enable meta-analysis, data integration, and data mining. Standardized data formats and controlled vocabularies are essential for building a suitable infrastructure for that purpose but are not available in the cell migration domain. We here present standardization efforts by the Cell Migration Standardisation Organisation (CMSO), an open community-driven organization to facilitate the development of standards for cell migration data. This work will foster the development of improved algorithms and tools and enable secondary analysis of public datasets, ultimately unlocking new knowledge of the complex biological process of cell migration

Ghent University Academic Bibliography

Leiden University Scholary Publications

ePubs: the open archive for STFC research publications

Juelich Shared Electronic Resources

University of Dundee Online Publications

FAIR Data Pipeline: provenance-driven data management for traceable scientific workflows

Modern epidemiological analyses to understand and combat the spread of disease depend critically on access to, and use of, data. Rapidly evolving data, such as data streams changing during a disease outbreak, are particularly challenging. Data management is further complicated by data being imprecisely identified when used. Public trust in policy decisions resulting from such analyses is easily damaged and is often low, with cynicism arising where claims of "following the science" are made without accompanying evidence. Tracing the provenance of such decisions back through open software to primary data would clarify this evidence, enhancing the transparency of the decision-making process. Here, we demonstrate a Findable, Accessible, Interoperable and Reusable (FAIR) data pipeline developed during the COVID-19 pandemic that allows easy annotation of data as they are consumed by analyses, while tracing the provenance of scientific outputs back through the analytical source code to data sources. Such a tool provides a mechanism for the public, and fellow scientists, to better assess the trust that should be placed in scientific evidence, while allowing scientists to support policy-makers in openly justifying their decisions. We believe that tools such as this should be promoted for use across all areas of policy-facing research

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server

PubMed Central

Edinburgh Research Explorer

Enlighten

White Rose Research Online

Hal-Diderot

SRUC - Scotland's Rural College

The health care and life sciences community profile for dataset descriptions

Access to consistent, high-quality metadata is critical to finding, understanding, and reusing scientific data. However, while there are many relevant vocabularies for the annotation of a dataset, none sufficiently captures all the necessary metadata. This prevents uniform indexing and querying of dataset repositories. Towards providing a practical guide for producing a high quality description of biomedical datasets, the W3C Semantic Web for Health Care and the Life Sciences Interest Group (HCLSIG) identified Resource Description Framework (RDF) vocabularies that could be used to specify common metadata elements and their value sets. The resulting guideline covers elements of description, identification, attribution, versioning, provenance, and content summarization. This guideline reuses existing vocabularies, and is intended to meet key functional requirements including indexing, discovery, exchange, query, and retrieval of datasets, thereby enabling the publication of FAIR data. The resulting metadata profile is generic and could be used by other domains with an interest in providing machine readable descriptions of versioned datasets

Carleton University's Institutional Repository

The FAIR Guiding Principles for scientific data management and stewardship

Author: Aalbersberg I.J. (Ijsbrand Jan)
Appleton G. (Gabrielle)
Axton M. (Myles)
Baak A. (Arie)
Blomberg N. (Niklas)
Boiten J.W. (Jan-Willem)
Bourne P.E. (Philip)
Bouwman J. (Jildau)
Brookes A.J. (Anthony)
Clark T. (Tim)
Crosas M. (Mercè)
Dillo I. (Ingrid)
Dumon O. (Olivier)
Dumontier M. (Michel)
Edmunts S. (Scott)
Evelo C.T. (Chris)
Finkers R. (Richard)
Goble C.A. (Carole Ann)
Gonzalez-Beltran A. (Alejandra)
Gray A. (Alastair)
Grethe S. (Jeffrey)
Groth P. (Paul)
Heringa J. (Jaap)
Hoen P.A.C. (Peter) 't
Hooft R. (Rob)
Kok J. (Joost)
Kok R. (Ruben)
Kuhn T. (Tobias)
Lei J. (Johan) van der
Lusher S.J. (Scott)
Martone M.E. (Maryann)
Mons A. (Albert)
Mons B. (Barend)
Mulligen E.M. (Erik) van
Packer A. (Abel)
Persson B. (Bengt)
Roca-Serra P. (Philippe)
Roos M. (Marco)
Sansone S.A. (Susanna-Assunta)
Schaik R. (Rene) van
Schultes E. (Erik)
Sengstag T. (Thierry)
Silva Santos L.B. (Luiz Bonino) da
Slater T. (Ted)
Strawn G. (George)
Swertz M. (Morris)
Thompson M. (Mark)
Velterop J. (Jan)
Waagmeester A. (Andra)
Wilkinson J.M. (Mark)
Wittenburg P. (Peter)
Wolstencroft K. (Katherine)
Zhao J. (Jun)
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 15/03/2016
Field of study

There is an urgent need to improve the infrastructure supporting the reuse of scholarly data. A diverse set of stakeholders—representing academia, industry, funding agencies, and scholarly publishers—have come together to design and jointly endorse a concise and measureable set of principles that we refer to as the FAIR Data Principles. The intent is that these may act as a guideline for those wishing to enhance the reusability of their data holdings. Distinct from peer initiatives that focus on the human scholar, the FAIR Principles put specific emphasis on enhancing the ability of machines to automatically find and use the data, in addition to supporting its reuse by individuals. This Comment is the first formal publication of the FAIR Principles, and includes the rationale behind them, and some exemplar implementations in the community

Erasmus University Digital Repository

The Ontology for Biomedical Investigations

Author: A Gangemi
A González-Beltrán
A González-Beltrán
A González-Beltrán
Alan Ruttenberg
Alejandra Gonzalez-Beltran
Allyson L. Lister
Anita Bandrowski
AR Jones
B Smith
B Smith
Barry Smith
Bill Bug
Bjoern Peters
BP Evren Sirin
CA Ball
Carlo Torniai
CF Taylor
Chris F. Taylor
Christian J. Stoeckert
CJ Mungall
D Field
D Field
D Schober
Daniel Schober
DG Thomas
Dirk Derom
E Maguire
Elisabetta Manduchi
EW Sayers
Frank Gibson
G Rustici
Gilberto Fragoso
Helen Parkinson
James A. Overton
James Malone
JB Gutierrez
Jennifer Fostel
Jessica A. Turner
Jie Zheng
K Degtyarenko
K Haug
Kevin Clancy
Larisa N. Soldatova
Liju Fan
LN Soldatova
LN Soldatova
LN Soldatova
M Ashburner
M Courtot
M Horridge
M Musen
MA Haendel
Marcus C. Chibucos
Mark Jensen
Mathias Brochhausen
Matthew H. Brush
MC Byrne
MC Chibucos
Melissa A. Haendel
Mervi Heiskanen
Michel Dumontier
Monnie McGee
Mélanie Courtot
Norman Morrison
P Grenon
P Kohonen
P Rocca-Serra
P Rocca-Serra
P Rocca-Serra
Patricia L. Whetzel
Philippe Rocca-Serra
Phillip Lord
PL Whetzel
PL Whetzel
PT Spellman
R Arp
R Leinonen
R Vita
Randi Vita
Richard H. Scheuermann
Ryan Brinkman
S Orchard
S-A Sansone
SA Sansone
SA Sansone
Susanna-Assunta Sansone
TF Rayner
Tina Hernandez-Boussard
TP Sneddon
VG Dugan
Y Kazakov
Yongqun He
Yu Lin
Yu Xue
Z Xiang
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2016
Field of study

The Ontology for Biomedical Investigations (OBI) is an ontology that provides terms with precisely defined meanings to describe all aspects of how investigations in the biological and medical domains are conducted. OBI re-uses ontologies that provide a representation of biomedical knowledge from the Open Biological and Biomedical Ontologies (OBO) project and adds the ability to describe how this knowledge was derived. We here describe the state of OBI and several applications that are using it, such as adding semantic expressivity to existing databases, building data entry forms, and enabling interoperability between knowledge resources. OBI covers all phases of the investigation process, such as planning, execution and reporting. It represents information and material entities that participate in these processes, as well as roles and functions. Prior to OBI, it was not possible to use a single internally consistent resource that could be applied to multiple types of experiments for these applications. OBI has made this possible by creating terms for entities involved in biological and medical investigations and by importing parts of other biomedical ontologies such as GO, Chemical Entities of Biological Interest (ChEBI) and Phenotype Attribute and Trait Ontology (PATO) without altering their meaning. OBI is being used in a wide range of projects covering genomics, multi-omics, immunology, and catalogs of services. OBI has also spawned other ontologies (Information Artifact Ontology) and methods for importing parts of ontologies (Minimum information to reference an external ontology term (MIREOT)). The OBI project is an open cross-disciplinary collaborative effort, encompassing multiple research communities from around the globe. To date, OBI has created 2366 classes and 40 relations along with textual and formal definitions. The OBI Consortium maintains a web resource (http://obi-ontology.org) providing details on the people, policies, and issues being addressed in association with OBI. The current release of OBI is available at http://purl.obolibrary.org/obo/obi.owl

PhilPapers

Public Library of Science (PLOS)

Maastricht University Research Portal

Goldsmiths Research Online

Crossref

Directory of Open Access Journals

PubMed Central

Oxford University Research Archive

Brunel University Research Archive

FigShare

Software Citation Implementation Challenges

Institute of Transport Research:Publications

ISA API : An open platform for interoperable life science experimental metadata

Author: Batista Dominique
Cochrane Keeva
Davey Robert P.
Etuk Anthony
Gonzalez-Beltran Alejandra
Haug Kenneth
Izzo Massimiliano
Johnson David
Larralde Martin
Lawson Thomas N.
Minotto Alice
Moreno Pablo
Nainala Venkata Chandrasekhar
O'Donovan Claire
Pireddu Luca
Rocca-Serra Philippe
Roger Pierrick
Sansone Susanna-Assunta
Shaw Felix
Steinbeck Christoph
Weber Ralf J. M.
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2021
Field of study

Background. The Investigation/Study/Assay (ISA) Metadata Framework is an established and widely used set of open source community specifications and software tools for enabling discovery, exchange, and publication of metadata from experiments in the life sciences. The original ISA software suite provided a set of user-facing Java tools for creating and manipulating the information structured in ISA-Tab—a now widely used tabular format. To make the ISA framework more accessible to machines and enable programmatic manipulation of experiment metadata, the JSON serialization ISA-JSON was developed.Results. In this work, we present the ISA API, a Python library for the creation, editing, parsing, and validating of ISA-Tab and ISA-JSON formats by using a common data model engineered as Python object classes. We describe the ISA API feature set, early adopters, and its growing user community.Conclusions. The ISA API provides users with rich programmatic metadata-handling functionality to support automation, a common interface, and an interoperable medium between the 2 ISA formats, as well as with other life science data formats required for depositing data in public databases

Publikationer från Uppsala Universitet

Oxford University Research Archive

Digitala Vetenskapliga Arkivet - Academic Archive On-line